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Abstract 


Density functional theory (DFT) is the most versatile electronic structure method used in quantum chemical 
calculations, and is increasingly applied in astrochemical research. This mini-review provides an overview of the 
applications of DFT calculations in understanding the chemistry that occurs in star-forming regions. We survey 
investigations into the formation of biologically relevant compounds such as nucleobases in the interstellar 
medium, and also cover the formation of both achiral and chiral amino acids, as well as biologically relevant 
molecules such as sugars, and nitrogen-containing polycyclic aromatic hydrocarbons. Additionally, DFT 
calculations are used to estimate the potential barriers for chemical reactions in astronomical environments. We 
conclude by noting several areas that require more research, such as the formation pathways of chiral amino acids, 
complex sugars, and other biologically important molecules, and the role of environmental factors in the formation 
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1. Introduction 
1.1. Astronomical Detection 


A plethora of biologically relevant compounds, such as 
amino acids, nucleobases and sugar derivatives, has been 
identified in samples of meteorites, comets and asteroids 
(Burton et al. 2012). In addition, a number of biologically 
relevant molecules have been observed in the interstellar 
medium (ISM) (Guelin & Cernicharo 2022). These findings 
strongly suggest an extraterrestrial origin for the elemental 
building blocks of life on Earth and raise intriguing questions 
about the formation of these molecules in space and their 
delivery to planets (as depicted in Figure 1). Addressing these 
questions is crucial for understanding abiogenesis and the 
search for life in the vast expanse of the Universe. To date, over 
200 molecular species have been detected in the interstellar and 
circumstellar medium (Mcguire 2022). Most of these molecules 
are organic in nature due to the fact that H, C, N and O are the 
most abundant elements in the Universe. Although the majority 
of these organic molecules are abiotic, a reaction between them 
presents a vast variety of potential complex organic molecules 
(COMs) which could be employed by life. 

Despite the long-held belief that the space environment is 
inimical to the formation and survival of COMs, the emergence 
of astronomical spectroscopy techniques has provided evidence 
to the contrary (López-Sepulcre et al. 2019; Li 2020; Li et al. 
2021; Lei et al. 2022). For instance, COMs were detected in 
various interstellar regions, some of which demonstrate a high 


degree of organic chemistry. Examples include formamide 
(H2NCHO) and other amides, which were identified in Sgr B2 
and Orion KL (Rubin et al. 1971; Suzuki et al. 2018). These 
molecules contain an amino group, which is analogous to the 
amino acids found in peptide chains, and are thus considered 
precursors of the genetic material of life. Another type of 
astronomically detected compound associated with life is 
nitriles, a class of molecules containing one or more nitrile 
functional groups. These molecules act as raw materials for the 
formation of ribonucleic acid (RNA). Acrylonitrile (HD? CCHCN) 
is an example of a nitrile, which was detected in the formation 
region of a massive star (Suzuki et al. 2018). 

The detection of COMs presents several challenges 
compared to simpler molecules like CO, CS and HCN. The 
criteria for COM detection have been summarized by Snyder 
et al. (2005), considering factors such as low abundance, 
complex energy levels and line confusions caused by similar 
rest frequencies. Among COMs, amino acids are of particular 
interest in the context of the origin of life in the Universe, but a 
reliable detection of amino acids has not been achieved to date. 
The simplest amino acid, glycine, has been reported as a 
detection in several sources (Kuan et al. 2003), but it has been 
pointed out that these lines are not actually from glycine 
(Snyder et al. 2005). Significant discoveries of new COMs in 
recent decades include molecules with peptide-like bonds such 
as formamide (Rubin et al. 1971), acetamide (Hollis et al. 
2006), N-methylformamide (Belloche et al. 2019) and 
propionamide (Li et al. 2021). Chiral molecules like propylene 
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Figure 1. Schematic for the plausible role of interstellar biomolecules in the emergence of life, through the bombardment of the planets by meteorites in the solar 
system. The background figure is from NASA’s image library with general permission to reuse. 


oxide (McGuire et al. 2016), sugar-type molecules like 
ethylene glycol (Hollis et al. 2002) and branched alkyl 
molecules like isopropyl cyanide (Belloche et al. 2014) have 
also been detected, marking important discoveries in the field 
of astrochemistry. However, fundamental questions about the 
formation of these COMs in space, their ability to withstand 
radiation and their mechanisms of transport to planets or 
through cosmic history remain largely unexplored. Answers to 
these questions hold the potential to eventually unveil the 
origins of life, potentially discovering extraterrestrial life and 
identifying habitable environments for humanity. 


1.2. Chemistry in the ISM 


The reactive routes remain as key questions after the detection 
of raw materials for the formation of biomolecules in space 
(Aponte et al. 2017; Barone & Puzzarini 2022; Rimola et al. 
2022). Due to the molecular complexity, biomolecules are 
generally difficult to form through direct collisions between 
molecules of closed shells in the cold ISM environment. 
Photochemical and radiation chemical processes are thus often 
considered to be the primary steps in the synthesis of interstellar 
COMs (Hollenbach & Tielens 1999; Castellanos et al. 2018; 
Arumainayagam et al. 2019). Laboratory studies of astronomical 
ice analogs suggest that there may be a complex chemistry on 


the ice surface exposed to radiation in star-forming regions 
(Materese et al. 2021). Ice mantles are supposed to concentrate 
molecules, while the radiation from young stellar objects can 
excite molecules, leading to barrierless reactions, as demon- 
strated in Figure 2. However, it could be challenging to generate 
meaningful quantities of biomolecules on ice mantle, due to the 
short lifetime of excited states and the photoinstability of the 
intermediate and product species. 

In the process of molecular clouds collapsing to form a 
star, the temperature of the gas surrounding the young stellar 
object increases passively due to the protostar’s radiation. 
When the temperature is sufficiently high to evaporate the ice 
mantle, warm molecular gases are observed at temperatures 
of 100—300 K, known as hot cores. These hot cores display a 
rich organic chemistry with the majority of observed O`, N^ 
and S. containing COMs detected in regions such as Sgr B2 
and Orion KL (Herbst & van Dishoeck 2009; Herbst 2014). 
Additionally, the protoplanetary disk, where planets are 
formed, has been a focus of research concerning abiogenesis. 
However, despite only relatively simple organics having been 
detected in protoplanetary disks (Oberg et al. 2020), the gas 
temperature can reach comparable levels to those of hot 
cores, as the disk temperature increases from 10 to several 
thousand K from the outer layer to the central forming star 
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Figure 2. Schematic for reactions in space occurring in the gas phase (top), and on the surface of grains (bottom). The background figure is from NASA’s image 


library with general permission to reuse. 


(Akimkin et al. 2013). In hot cores and protoplanetary disks, 
molecules at relatively high density (typically 10’-10° cm ? 
in these regions can effectively screen out ultraviolet (UV) 
radiation, and provide the kinetic energy necessary to 
overcome the barriers for synthesizing COMs free of 
photochemical and radiation chemical processes. 

The chemistry of COMs in the gas-phase ISM is less studied 
than chemistry on ice by experimental means (Fuente et al. 
2019; Herbst 2021; Puzzarini 2022). There is still debate over 
the main reaction mechanism that leads to their formation. The 
majority of COMs with direct biological interests are complex 
molecules (210 atoms) and the energy barrier for their 
synthesis from closed-shelled species is usually very high, 
meaning high temperature is needed to activate the process. As 
a result, previous studies have predominantly focused on ion- 
molecule reactions or neutral-neutral reactions involving highly 
reactive radicals (Sandford et al. 2020). However, this 
viewpoint does not entirely align with the fact that ions and 
radicals exhibit short lifetime, and with the astronomical 
observation that the majority of interstellar molecules detected 
so far are neutral species with closed shells (Mcguire 2022). 

To this end, researchers have explored catalyzed reactions as 
an alternative approach to synthesize complex interstellar 


molecules. However, the limited abundance of suggested 
catalysts often renders catalyzed reactions seemingly impos- 
sible in the ISM, since they require the simultaneous collision 
of three bodies (Mendoza et al. 2004; Roy et al. 2007; Rimola 
et al. 2012; Saladino et al. 2013; Vinogradoff et al. 2015; da 
Silva & de Araujo 2017; Qi et al. 2018; Potapov et al. 2019; 
Hanine et al. 2020). Fortunately, this is not always the case, 
particularly when the concentration of the catalyst significantly 
exceeds that of the reactants, as is the case for neutral atomic 
hydrogen (H I). It has been discovered that the presence of HI 
leads to alternative pathways that substantially decrease the 
energy barriers for complex reactions, resulting in a significant 
enhancement of the reaction rate (Yang et al. 2023). 
Consequently, these reactions become thermodynamically 
feasible in certain regions of the ISM. This revolutionary 
finding suggests that H I-catalyzed reactions could represent a 
significant mechanism for the interstellar formation of COMs 
from species with closed shells. 


1.3. Focus of this Review 


It is essential to ascertain the intermediates in the formation 
pathways of interstellar COMs to understand their formation 
mechanisms. However, many of these intermediates are highly 
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reactive and short-lived, making their detection difficult in 
laboratory experiments. To this end, chemical networks are 
developed to model the nature and abundance of interstellar 
molecules, with the help of spectroscopic observations and 
laboratory experiments (Quan et al. 2010; Herbst & Yates 
2013; Zhang et al. 2020a, 2020b; Unsleber & Reiher 2020; 
Zhao et al. 2021). While this approach has been successful in 
studying many simple interstellar molecules, it has not been 
able to explain the observed abundances of COMs. A viable 
solution to this problem is to perform quantum chemical 
calculations (QCCs), in which molecular geometry optim- 
ization via ab initio electronic structure methods can be used to 
determine the intermediates. 

In this concise review, we provide an overview of dedicated 
QCCs that employ density functional theory (DFT) to model 
the synthesis of biologically relevant COMs in the ISM, with a 
primary emphasis on gas-phase chemistry. It is worth noting 
that previous reviews by Zamirri et al. (2019) have covered the 
formation of interstellar COMs with a focus on reactions 
occurring on ice surfaces. Sandford et al. (2020) and Jørgensen 
et al. (2020) have provided comprehensive reviews of studies 
concerning interstellar biomolecule formation, primarily 
focusing on laboratory experiments. 


2. Density Functional Theory 


QCCs have been utilized in the assessment of electronic 
contributions to physical and chemical properties of molecules 
(Mata & Suhm 2017; Sumiya et al. 2022). This involves the 
systematic application of approximations to the electronic 
Schrédinger equation, with the aim of obtaining electron 
densities and energies related to the positions of the nuclei and 
the number of electrons. QCCs are classified according to the 
type of approximation used to solve the Schródinger equation. 
The most commonly used approaches are DFT, Hartree-Fock 
(HF) and their extensions. The HF method is the simplest wave 
function based approach. It neglects many-electron correlations 
and thus has limited accuracy in the description of chemical 
bond formation. Post-HF methods, such as Mogller-Plesset 
second order perturbation (MP2) (Møller & Plesset 1934) and 
coupled cluster single, double and perturbative-triple electronic 
excitations (CCSD-(T)) (Gyevi-Nagy et al. 2020), improve the 
wave function to recover the missing electron correlation, 
providing increased accuracy, albeit at the cost of reduced 
computational efficiency. 

DFT is now widely regarded as the most versatile electronic 
structure method in QCCs, owing to its high computational 
efficiency (Jones 2015). In the Kohn-Sham theory, DFT 
reduces the many-body problem of interacting electrons in an 
external field to a problem of noninteracting electrons moving 
in an effective potential with the exchange and correlation (XC) 
interactions. Despite its early limitations, DFT has become 
increasingly accurate for QCCs since the 1990s, following the 
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refinement of its approximations in order to better describe XC 
interactions. 

The Local Density Approximation (LDA) in DFT assumes 
that the XC energy functional is solely based on the electron 
density at each point in space. This has been shown to lead to 
an underestimation of exchange energy and an overestimation 
of correlation energy. To address this issue, Generalized 
Gradient Approximations (GGAs) have been proposed, which 
incorporate higher-order derivatives of the electron density. 
Despite its widespread use, DFT at the GGA level has been 
known to have difficulty accurately describing intermolecular 
interactions, such as van der Waals dispersion, charge transfer 
excitation and transition states. 

The difficulties associated with DFT in describing the 
exchange interaction can be addressed by the incorporation of a 
contribution calculated from the HF theory, known as hybrid 
functionals. An example of this is the popular Becke, 
3-parameter, Lee-Yang-Parr (B3LYP) functional, which is 
based on a linear combination of the HF exact exchange 
functional and the GGA. An illustrative example of the 
application of this approach to the abiogenesis question is the 
determination of the abiotic synthesis pathway of adenine 
(Ade) from HCN. Experiments have demonstrated that Ade can 
be synthesized from a solution of HCN and ammonia (NH3) 
since 1960 (Oró 1960). However, the possible pathways of this 
reaction remained unknown until Roy et al. (2007) performed 
calculations at the B3LYP DFT level. 

It has been suggested that meta-hybrid GGA functionals, 
such as Minnesota ones, could be potentially more accurate 
than the popular B3LYP (Hohenstein et al. 2008; Riley et al. 
2010; Peverati & Truhlar 2011; Ferrighi et al. 2012). This 
family includes the M06-L, M06, M06-2X and M06-HF 
functionals, each with a different amount of exact exchange. 
Their expansion terms are dependent on the electron density, 
the gradient of the density and the second derivative of the 
density. This suite of functionals has demonstrated a successful 
record for systems containing dispersion forces, thus correcting 
one of the most prominent insufficiencies of standard DFT 
methods. 

The accuracy of DFT is dependent on the choice of XC 
functional and the representation of the electronic wave 
function (namely basis set), which can be composed of either 
atomic orbitals or plane waves. The former is commonly 
used for quantum chemistry applications. The most prevalent 
DFT functional/basis set combinations for astrochemistry 
include B3LYP/6-31g*, B3LYP/6-311--g*, B3LYP/6-311g 
(d,p), B3LYP/aug-cc-pVTZ, M06-2X/6-31g, M06-2X/6- 
311g(d,p) and M06-2X/aug-cc-pVTZ, and their derivatives. 
There are deficiencies associated with the most popular 
B3LYP/6-31G* model, such as missing London dispersion 
and basis set superposition error, which have been 
discussed in Kruse et al. (2012) with regard to DFT 
calculations of molecular thermochemistry. Additionally, a 
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Figure 3. Chemical structures of the five primary nucleobases. 


review by Mardirossian & Head-Gordon (2017) provides a 
comparison between different levels of DFT, benchmarking 
ca. 200 density functionals on a molecular database of 
almost 5000 data points of non-covalent interactions, 
isomerization energies, thermochemistry and barrier heights. 

Although DFT has proven to be a valuable tool, it possesses 
certain limitations (Cohen et al. 2012). One significant 
challenge in studying chemical reactions is accurately 
describing reaction barriers and energetics. For certain systems, 
DFT often struggles to predict activation energies with 
precision, especially for reactions involving complex transition 
states or significant rearrangements of molecular structures. 
This limitation arises from the inherent approximation in the 
exchange-correlation functional, leading to errors in energy 
barriers. Additionally, handling solvent effects can be 
challenging, although this is less relevant in the low-density 
environment of the ISM. To address solvent effects accurately, 
alternative methods like implicit solvent models, explicit 
solvent molecules or hybrid quatumn mechanics /molecular 
mechanics models may be necessary (Senn & Thiel 2009; 
Acevedo & Jorgensen 2010). Moreover, DFT encounters 
difficulties when dealing with reactions involving large 
systems. Despite being more computationally efficient than 
most wave function-based methods, DFT calculations can be 
demanding for larger molecular systems, such as mixtures 
containing more than ten molecules. In such cases, molecular 
dynamics simulations could be more suitable than QCCs 
(Wang 2019, 2020, 2023). However, by carefully considering 
the limitations and implementing appropriate strategies, DFT 
can still be effectively utilized as a powerful tool for studying 
interstellar chemical reactions (Jones 2015). 

The accurate computation of electron density and free energy 
of bonding molecules has made DFT a powerful tool for 
exploring the reactive free-energy surfaces. Through the DFT 
structural optimization approach, it is possible to predict the 
intermediates and products that constitute a reaction pathway. 
This process involves updating the positions of the nuclei in 
colliding molecules toward the minimum of the potential 


energy, until optimal geometries of the molecules are found at a 
stationary point on the system's potential energy surface (PES), 
where the net interatomic force on each atom is acceptably 
close to zero (Althorpe & Clary 2003). To confirm the 
intermediate and transition state, vibration frequency calcula- 
tions are usually performed. Additionally, intrinsic reaction 
coordinate (IRC) calculations can be applied to ensure the 
transition states reasonably connect the reactants and products. 

The following section will survey DFT studies on the 
interstellar synthesis of biologically relevant COMs. The 
content will be organized by the categories of different 
biomolecules, including nucleobases, amino acids, sugars and 
other species. 


3. Nucleobases 


Nucleobases, nitrogen-containing compounds that make up 
the basic components of nucleic acids, include five primary 
nucleobases: adenine (Ade, C5H5N;), cytosine (Cyt, C4H5N30), 
guanine (Gua, CsH;N;O), thymine (Thy, CsH6N202) and uracil 
(Ura, C4H4N5O»). The three pyrimidine bases, Cyt, Thy and 
Ura, are composed of a single ring structure, while the two 
purine bases, Ade and Gua, have a fused-ring configuration, as 
illustrated in Figure 3. These five primary nucleobases serve as 
the fundamental units of the genetic code, with Ade, Gua, Cyt 
and Thy present in deoxyribonucleic acid (DNA) and Ade, Gua, 
Cyt and Ura present in ribonucleic acid (RNA). While direct 
observations of primary nucleobases in the ISM remain elusive, 
various sets of Ade precursors have been identified. These 
include aminoacetonitrile H2NCH5;CN (Belloche et al. 2008), 
carbodiimide HNCNH (McGuire et al. 2012), cyanamide 
H3NCH (Turner et al. 1975), cyanomethanimine HNCHCN 
(Zaleski et al. 2013) and glycolonitrile HOCH2CN (Zeng et al. 
2019). Additionally, hydroxylamine (NH2OH) is a known 
precursor for pyrimidines and purines (Rivilla et al. 2020), while 
ethanimine (CH3;CHNH) may play a role in amino acid 
synthesis (Loomis et al. 2013). 
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3.1. Pyrimidine Bases 


Most studies on the interstellar formation of pyrimidine 
nucleobases focus on routes from pyrimidine (C4H4N5), due 
to its chemical structure being directly relevant to that of the 
nucleobases. In the astronomical context, pyrimidine is known 
to be a precursor of nucleobases in interstellar ice analogs. 
Using laboratory simulations of interstellar environments, 
Nuevo et al. (2009) observed the formation of Ura in the 
residue generated from the UV photolysis of pyrimidine in 
pure water ices. To explore the mechanism of the reactions, 
DFT at the B3LYP level was employed to aid in the 
identification of a number of key intermediates (Bera et al. 
2010). Oba et al. (2019) reported the presence of pyrimidine 
and purine nucleobases in interstellar ice analogs composed of 
H20, CO, NH, and CH3OH after exposure to UV irradiation 
followed by thermal processes. Laboratory experiments 
demonstrate that the UV photoprocessing of pyrimidine 
and purine in simple ices of astrophysical interest can result 
in the formation of all five primary nucleobases (Materese 
et al. 2018). 

Cole et al. (2015) utilized an ion trap mass spectrometer and 
DFT calculations at the B3LYP level to reveal an interesting 
connection between the pyrimidine nucleobase anions and 
cyanate (OCN ). Similarly, Gupta et al. (2013) employed the 
same theoretical method to propose two exothermic formation 
pathways for Cyt, beginning from propynylidyne (CCCH) and 
cyanoacetylene (HCCCN). Choe (2021) used DFT calculations 
to investigate synthesis pathways toward Cyt, Ura and Thy on 
icy grain mantles, through reactions between HCCCN, 
protonated isocyanic acid (HoNCO *), and one of NH3, H5O 
or CH3OH. Furthermore, Choe (2020) proposed a formation 
route of a Cyt resulting from the reaction of urea with a 
protonated cyanoacetylene (CAH *) and H50. 

Pyrimidine has been found to be unstable under UV 
radiation (Peeters et al. 2005), leading to the consideration of 
alternative routes for nucleobase formation. Formamide 
(HoNCHO) has been proposed as a reactant in this context 
(Rotelli et al. 2016; Saladino et al. 2016), and it has been 
demonstrated that the polymerization of its dehydration 
products can lead to nucleobase formation (Nguyen et al. 
2015; Jeilani et al. 2016). DFT studies conducted at the 
B3LYP/6-311----G(d,p) level have suggested that the 
reactions of free radicals such as CCCNH or CCCO may lead 
to the formation of pyrimidine bases (Wang & Bowie 2012). 

Using DFT calculations at the M06/6-31+G(d,p)/6-311+ 
+G(d,p) level, Lu et al. (2021) investigate the gas phase 
reaction between partially dehydrogenated formamide and 
vinyl cyanide (H;CCHCN) for synthesizing Cyt, Thy and Ura 
via 1H-pyrimidin-2-one (C4H4N5O), which acts as a direct 
nucleobase precursor. Their results indicate that the H 
migration is the rate-determining process in the synthesis of 
the pyrimidine bases. However, it should be noted that highly 
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dehydrogenated free radicals may be unstable in interstellar 
molecular clouds due to their high chemical reactivity and 
consequent short lifetime. Moreover, an interesting approach of 
automated path search was used by Komatsu & Suzuki (2022) 
to study the synthesis of Cyt from NH;CCHO, based on DFT at 
the levels of UB3LYP and UM06-2X. 


3.2. Purine Bases 


The formation of purine bases Ade and Gua is more complex 
in comparison to the pyrimidine bases. Laboratory experiments 
have employed photochemical reactions of purine in interstellar 
ice analogs to synthesize Ade, Gua and their derivatives 
(Materese et al. 2017, 2018), with 2-aminopurine and 
isoguanine hypothesized to be two key intermediates. DFT 
calculations utilizing the B3LYP functional with the cc-pVDZ 
basis set have been performed to model this reaction (Bera 
et al. 2017). The results of the calculations suggest a multistep 
reaction mechanism involving a purine cation, hydroxyl and 
amino radicals, as well as water and ammonia. The effect of the 
ice surface was mimicked in the same work using a conductor 
solvent model. 

The pathway of HCN pentamerization has been the focus of 
many follow-up works. For example, Gupta et al. (2011) used 
DFT at the B3LYP/6-31G** level to form Ade from HCN, 
HCCN, NH2CN and CN, and estimated rate coefficient up to 
8.71 x 10? em? s^! Jung & Choe (2013) applied DFT at the 
B3LYP/6-311G(2d,d,p) level to simulate the Ade synthesis by 
oligomerizations of HCN, however reporting a high barrier of 
this reaction (a few hundred kJ/mol). Cole et al. (2015) relied 
on a combination of experiments and DFT at the B3LYP/6- 
311+-+G(d,p) level to study the formation of the purine bases 
in a reversed manner, by measuring the products of dissociating 
Ade and Gua. This revealed a number of interesting products, 
including carbodiimide (HNCNH), cyanamide (NH2CN) and 
isocyanic acid (HNCO). Additionally, Merz et al. (2014) 
proposed a synthesis pathway toward Ade from CCCNH and 
HNCNH and its isomer HNCN using MP2 calculations. 

Wang et al. (2013) proposed a synthesis pathway from 
formamide to Ade using DFT at the B3LYP/6-311G(d,p) 
level. Additionally, Choe (2018) utilized DFT at the B3LYP/ 
311G(2d,d,p) level to suggest formation pathways of Gua from 
4-aminoimidazole-5-carbonitrile (AICN). In these reactions, 
H2O was considered as a catalyst, leading to substantial 
decreases in activation barriers. Jeilani et al. (2018) further 
studied the free radical routes to the formation of Ade and Gua 
by DFT at the UB3LYP/6-311G(d,p) level, and identified two 
key intermediates, 5-aminoimidazole-4-carboxamide (AICA) 
and 5-(formylamino) imidazole-4-carboxamide (fAICA). The 
B3LYP results were compared to M06 data, and it was found 
that the results were comparable. 
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Figure 4. Molecular representation of the two enantiomers of Ala. 


4. Amino Acids 


Alpha-amino acids, which constitute proteins in the genetic 
code, have been detected in meteorites and comets, making 
them of particular interest in astrobiology. All of the alpha 
amino acids in the genetic code, with the exception of glycine 
(Gly, CoH5NO)), are chiral in nature. That is, these molecules 
exist in two stereoisomers that are mirror images of each other, 
referred to as enantiomers. In Figure 4, the example of Ala 
(C3H7NO;) is shown, with its two enantiomers: D-Ala and 
L-Ala, which possess a typical chirality center represented by a 
central carbon atom bonded to four distinct groups. 

The origin of biological homochirality remains an 
unresolved question, particularly regarding the chemical 
processes responsible for the chiral characteristics of alpha- 
amino acids in the context of astrochemistry. Previous 
investigations into the interstellar synthesis of amino acids 
have mainly concentrated on Gly due to its relative simplicity. 
The proposed mechanisms for its formation often involve 
Strecker-type synthesis, as well as interactions between radicals 
and radicals or radicals and neutral species. 


4.1. Glycine 


In Bernstein et al. (2002), experiments were conducted to 
explore the formation of Gly and other amino acids when 
cryogenic water ice containing small amounts of CH;0H, NH; 
and HCN were irradiated with UV light. Woon (2002) evaluated 
the viability of these pathways by employing QCCs at the MP2 
level with aug-cc-pVDZ basis sets, finding that isotopic 
substitution experiments would identify CH30H as the source 
of the C in the COOH carboxylic acid group of the amino acids. 
Subsequently, Holtom et al. (2005) conducted a combined 
experimental and theoretical study on the formation of Gly and 
its isomer in interstellar ices, employing density DFT at the 
B3LYP/6-311G(d,p) level. This work revealed that H atoms 
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with sufficient kinetic energy could overcome the entrance 
barrier to add to a CO», yielding a trans-hydroxycarbonyl radical 
which then could recombine with other radicals to form Gly and 
its isomer. Similar studies were undertaken by Nhlabatsi et al. 
(2016) with DFT approaches, Rimola et al. (2010, 2012) with a 
cluster approach employing DFT calculations at the B3LYP/6- 
314-G(d,p) level, Bossa et al. (2010), Oba et al. (2015), Sato 
et al. (2018), Joshi & Lee (2022), Thripati (2022) with DFT 
calculations and Krasnokutski et al. (2020) with DFT+experi- 
ments, all proposing pathways to Gly on interstellar ices/grains. 

In comparison to surface chemistry, the gas-phase reactions 
leading to the formation of Gly have been studied to a lesser 
extent. Pilling et al. (2011) used B3LYP DFT and MP2 
calculations to propose formation routes to Gly from carboxylic 
acids as well as carboxyl radical (COOH), both in the gas- 
phase and solid-phase. It was suggested that the most favorable 
gas-phase reactions were between acetic acid (CH;COOH) 
with NH* or NH5OH, or NH»CH, with COOH". Singh et al. 
(2013) reported that the simple molecules NH», CH», CH, and 
CO could form Gly in both the gaseous and grain phases, based 
on DFT calculations at the B3LYP/6-31G (d,p) level. 
Kayanuma et al. (2017) used the B3LYP functional and 
6-31G* basis sets to propose synthesis pathways to Gly from 
aminoacetonitrile (NH? CH5CN) and hydantoin (2,4-imidazoli- 
dinedione, C4H4N50;) through the Bücherer-Bergs reaction 
and hydrolysis in both the gas- and solid-phases. In addition, 
DFT calculations at the B3LYP/6-311++G(2df,p) level are 
also used to study the formation of Gly peptide chain via 
unimolecular reactions (Comte et al. 2023). 


4.2. Chiral Alpha-amino Acids 


The formation of chiral amino acids in the ISM is not as well 
understood as that of Gly, despite the fact that homochirality is 
a common property of amino acids in biosystems. Most of the 
research in this domain has been focused on relatively simple 
amino acids such as Ala and Serine (Ser). Elsila et al. (2007) 
used isotopic labeling techniques to observe multiple reaction 
pathways for the formation of Gly and Ser in interstellar ice 
analogs, suggesting that the formation of amino acids is not 
narrowly dependent on ice composition. 

Shivani et al. (2017) suggested, through DFT calculations, 
that the reaction between CH3CN, CH3CNH» and HCOOH 
could lead to the formation of Ala. The same group explored 
the possibility of Ser formation in the ISM through detected 
interstellar molecules such as CH, CO and OH by radical- 
radical and radical-neutral interactions in the gaseous phase 
using DFT at the B3LYP/6-311G+(2df,2p) level (Shivani 
et al. 2014). The results were calibrated with a higher-level 
M06 and M06-2X functional, suggesting that the proposed 
reactions with low potential barriers could occur in the ISM. 
Rani & Vikas (2018) present a study of the pathways of 
stereoinversion in l-threonine, an amino acid with two 
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stereocenters, employing DFT at the M06-2X/aug-cc-pVTZ 
level. A simultaneous intramolecular proton and hydrogen 
atom transfer is observed to drive the stereoinversion, and the 
reaction rates in the temperature range of 500-1000 K are 
found to be significant. In a recent study, Liao et al. (2023) 
employed DFT at the M06-2X/6-31--G(d,p) level to examine 
the interstellar formation of Ala, Ser, and isovaline, with a 
focus on the key step in the synthesis pathway that determines 
their chirality. 


5. Sugars and Other Biology-relevant COMs 


The presence of sugars such as ribose and xylose has been 
identified in meteorites (Laneuville et al. 2018; Smith et al. 2020), 
prompting exploration into the formation of these molecules in 
star-forming regions. Woods et al. (2013) used DFT calculations 
to examine the dimerization of the formyl radical and its 
implications for the formation of glycolaldehyde (C2H40;), the 
simplest sugar-related molecule. The results suggest that the 
transition state for dimerization is located within the formyl 
radical, and that the reaction is energetically favorable under low 
temperature conditions, consequently leading to the formation of 
glycolaldehyde. This finding has implications for the formation 
of monosaccharides, disaccharides, and polysaccharides, all of 
which are important for astrochemistry and astrobiology. 

Thripati & Ramabhadran (2017) investigated the role of 
metal ions and hydrogen bonds in the formose reaction, the first 
step in the synthesis of sugars, utilizing DFT at the B3LYP/ 
aug-cc-pVTZ level. The simulations revealed that metal ions 
serve as catalysts in the formose reaction, while hydrogen 
bonds play a crucial role in stabilizing the reaction 
intermediates. Vazart et al. (2018) and Skouteris et al. (2018) 
studied the gas-phase formation routes for glycolaldehyde, 
acetic acid and formic acid, also known as the ethanol tree, 
through a combination of infrared spectroscopy and DFT 
calculations at the B2PLYP/m-aug-cc-pVTZ level. It was 
observed that radiative association is the principal mechanism 
for the formation of glycolaldehyde and acetic acid, while gas- 
phase elimination is the primary route for formic acid. Jeilani & 
Nguyen (2020) investigated the autocatalytic formose reaction 
and its potential role in the formation of RNA nucleosides 
using DFT at the B3LYP/6-311G(d,p) level. Results indicated 
that the autocatalytic formose reaction is capable of forming 
several RNA nucleosides, including Cyt and Ura. 

A study by Woon (2021) relied on DFT calculations to 
investigate the formation of glycolonitrile (HOCH2CN) from 
reactions between C* and HCN and HNC on icy grain mantles. 
It was demonstrated that the reactions can contribute to the 
formation of the molecule and that the reaction between C* and 
HCN is a major contributor in interstellar regions. Ahmad et al. 
(2020) presented a theoretical approach to elucidate the 
formation of C;H4O; isomers in the ISM through reaction of 
interstellar formaldehyde molecules, utilizing DFT molecular 
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dynamics simulations. The results revealed that the reaction 
between formaldehyde molecules is exothermic and occurs in a 
concerted way to produce three isomers: glycolaldehyde, 
methyl formate and acetic acid. 

DFT calculations have been used to investigate the formation 
of other biologically relevant COMs in the ISM. Xie et al. 
(2022) proposed a new pathway in the ISM, leading to the 
formation of biorelevant molecules carrying amine groups or 
peptide bonds via a single-photon ionization induced Michael/ 
cyclization reaction of acrylonitrile (AN)-alcohol heterodimer 
complexes in the gas phase, combining experiments and DFT 
calculations. Ahmadvand et al. (2014) investigated the PES of 
cyclopropenone (c-H5C3O) formation using the B3LYP DFT 
and found the spin-allowed pathway of c-H5C3O formation to 
be more energetically favorable than the spin-forbidden 
pathway under interstellar conditions. 

Nitrogen-substituted polycyclic aromatic hydrocarbons 
(NPAHS) exhibit strong emission features in the infrared spectral 
region, and are of importance for astronomical observations 
aiming at the abiogenesis question as they could be building 
blocks of biomolecules (Kovacs et al. 2020; Meng et al. 2021, 
2023). Parker and coworkers demonstrated the possibility of 
forming NPAHs from precursor molecules in the ISM through a 
combination of DFT calculations and experiments (Parker et al. 
2015a, 2015b, 2015c). Furthermore, molecular dynamics 
simulations employing DFT-fitted empirical reactive potentials 
were utilized to investigate the formation of interstellar PAHs 
(Qi et al. 2018; Hanine et al. 2020). Finally, DFT calculations at 
the B3LYP/6-311++G™ and 6-31--G** level were used by 
Yang et al. (2022) to examine the interaction between PAHs and 
organic molecules for forming PAH-organic molecule clusters. 
The results suggested that the PAH-organic molecule clusters 
were formed through hydrogen bonding, with the formation of 
clusters being more favorable when PAHs were more convex 
and highly functionalized. 


6. Astronomical Implications 


Using DFT, one can calculate the free energy of reactants, 
intermediates, transition states or products assuming non- 
interacting particles at a given thermodynamic temperature 
(McQuarrie & Simon 1999). For instance, one could use the 
average temperature of a specific phase of a star-forming 
region. These calculations can determine the potential barriers 
and assess the feasibility of a proposed reaction in a given 
astronomical environment. There are two common strategies 
for estimating the barrier, each leading to a different value. One 
approach assumes that the energy does not leave the system by 
radiation during atomic rearrangements in a reaction, while the 
other considers that energy dissipates after each step in a 
pathway. Under the former assumption, the barrier is defined as 
the difference in free energy between the highest-energy state 
and the RC, while in the latter, it is defined as the energy 
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Figure 5. (a) Reaction rate coefficient k and (b) half-life time t;/5 vs. temperature T for binary reactions with different potential barriers AG. 


needed to overcome the most costly step. Although the true 
situation lies somewhere between these idealized scenarios, the 
former approach is typically used for interstellar reactions in 
the gas phase because the atomic rearrangement is usually 
faster than the vibration cascade. 

The potential barrier is crucial in determining whether a 
reaction is feasible in the ISM. As temperature increases, 
molecules collide more frequently and with greater kinetic 


energy. The proportion of collisions that surpasses the barrier 
increases with temperature, following the Arrhenius relation. 
The relationship between the reaction rate constant k, the half- 
life time of reactants 1/5, the temperature and the barrier AG is 
described in the transition state theory (Eyring 1935), 
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Reference 


Gupta et al. (2013) 
Choe (2018) 
Choe (2020) 
Choe (2021) 

Nguyen et al. (2015) 
Jeilani et al. (2016) 
Wang & Bowie (2012) 
Benallou (2019) 
Jung & Choe (2013) 
Wang et al. (2013) 
Jeilani et al. (2018) 
Lu et al. (2021) 
Yang et al. (2023) 
Jeilani & Nguyen (2020) 


Nhlabatsi et al. (2016) 
Rimola et al. (2010) 
Rimola et al. (2012) 

Sato et al. (2018) 

Krishnan et al. (2017) 


Singh et al. (2013) 
Kayanuma et al. (2017) 
Shivani et al. (2017) 
Shivani et al. (2014) 


Thripati & Ramabhadran (2017) 
Jeilani & Nguyen (2020) 
Jeilani & Nguyen (2020) 

Ahmad et al. (2020) 
Ahmadvand et al. (2014) 
Parker et al. (2015c) 


Table 1 
Potential Barriers AG (in kcal mol!) of Previously Proposed Reactions for Synthesizing Biologically Relevant Species in ISM Using DFT 
Nucleobases 
Species AG Type Level of theory 
Cyt «12 rr/r-m B3LYP/6-311+G** B3LYP/6-31G** 
Gua barrierless m-m(c) B3LYP/6-311G(2d,d,p) 
Cyt/Ura/Thy barrierless i-m B3LYP/6-311G(d,p) 
Cyt barrierless i-m B3LYP/6-311G(d,p) 
Cyt/Ura/Thy 3-7 rr B3LYP/6-311G(d,p) 
Ade/Pur/Cyt/Ura 14-37.3 rr/r-m B3LYP/6-311G(d,p) 
Cyt/Ura/Thy 11.2 rm B3LYP/6-3114-4-G(d,p) 
Ade 54-57 m-m B3LYP/6-311G(d,p) 
Ade 70.5 m-m B3LYP/6-311G(2d,d,p) 
Ade 25.7 m-m(c) B3LYP/6-311G(d,p) 
Ade/Gua/Pur 4.3-29.8 rc UB3LYP/6-311G(d,p) M062x/6-311G(d,p) 
Cyt/Thy/Ura 2113 r-r/r-m/m-m M06/6-314-G(d,p)/6-3114-4-G(d,p) 
Ade/Gua >14.1 m-m(c) M06-2X/6-31--G(d.p) 
Ade/Gua/Cyt/Ura 33.1 rm B3LYP/6-311G(d,p) 
Amino acids 
Gly 71.4 m-m B3LYP/6-31++G(d,p) 
Gly 39.2-40.9 rm B3LYP/6-31--G(d,p) 
Gly 12 rm BHLYP/6-311++G(d,p) 
Gly 1.9-8 rr/r-m UB3LYP/6-311++G™ 
Gly 18.3-32.9 m-m(c) B3LYP/6-31+-+G(3df,2pd) 
Gly 15-73.8 rr/r-m B3LYP/6-31G (d,p) 
Gly 38-39 m-m(c) B3LYP/6-31G* 
Ala 2.1-6.8 rr/r-m B3LYP/6-311G(d,p) 
Ser 12-15 rm B3LYP/6-311G+(2df,2p) 
Sugar and others 
C;5H40; «89 i-m B3LYP-D3/aug-cc-pVTZ 
C2H402 21.3-23.2 r-r/i-m B3LYP/6-311G(d,p) 
Ribose 25.4-32.7 i-m B3LYP/6-311G(d,p) 
C2H402 44-95 m-m B3LYP/6-311G(d,p) 
c-H2C30 <45 r-m/m-m B3LYP/cc-pVTZ 
NPAHs barrierless rm B3LYP/6-311+G* 
Pyridine barrierless rm B3lYP/aug-cc-pVTZ 


Parker et al. (2015b) 


Note. Abbreviations for reaction type: r-r (radical-radical), m-m (molecule-molecule), i-m (ion-molecule), r-m (radical-molecule), c (catalyzed). 


where C is the transmission coefficient, kg the Boltzmann 
constant, T the absolute temperature, A the Planck constant and 
R the molar gas constant. The results plotted in Figure 5 show 
that, for achieving a half-life time shorter than 10? yr, a typical 
astrochemical time frame as highlighted by the green horizontal 
line in the lower panel, the reaction barrier needs to be lower 
than ca. 35 kcal mol! at 300 K. This limit roughly corresponds 
to a rate coefficient greater than 1071? cm? s^ !, as highlighted 
by the horizontal line in the upper panel. We note that the here- 
considered reactions are two-body collision processes, or 
catalyzed binary reactions in which the catalyst’s concentration 
greatly exceeds that of the two reactants (in the so-called 
pseudo-binary reactions). 

According to Figure 5, a potential barrier lower than ca. 
35 kcal mol ! signifies thermal feasibility in the formation 
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regions of young stellar objects, which are often associated 
with warm ambient gas at average temperatures up to 300 K, 
commonly referred to as hot cores or corinos. These regions 
typically exhibit a rich organic chemistry and contain the initial 
materials required for the formation of organic compounds 
(Herbst & van Dishoeck 2009). Note that a common mistake to 
avoid here is attempting to determine the activation temperature 
by a direct conversion of 35 kcalmol ! to its equivalent 
temperature (yielding ca. 17,613 K). This approach overlooks 
the fact that the molecules undergo reactions via collisions, and 
their energies are distributed with Boltzmann populations. 
Furthermore, the protoplanetary disk is of particular interest in 
studying abiogenesis since the chemical composition of the 
planets formed within it is believed to be inherited from the 
disk itself. The temperature of the disk gradually increases from 
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10 K to several thousand K, ranging from the outer layer to the 
central forming star (Akimkin et al. 2013). 

Table 1 summarizes the DFT-calculated potential barriers of 
previously proposed reactions for synthesizing biologically 
relevant species in ISM. For synthesizing complex biomole- 
cules from closed-shell species, the potential barrier AG is 
typically greater than 50 kcalmol '. However, reactions 
involving at least one transient species (atoms, radicals, ions) 
have much lower barriers and are feasible at the temperatures 
found in star-forming regions. Alternatively, catalyzed 
reactions have been proposed for the interstellar formation of 
COMs. Catalysts can facilitate difficult reactions between 
closed-shell species, making them easier to proceed under 
laboratory conditions. Various free radicals, ions, acids and 
substrates have been suggested as catalysts for the synthesis of 
interstellar COMs (Mendoza et al. 2004; Roy et al. 2007; 
Rimola et al. 2012; Jeilani et al. 2013; Saladino et al. 2013; 
Vinogradoff et al. 2015; Jeilani et al. 2016; da Silva & de 
Araujo 2017; Qi et al. 2018; Potapov et al. 2019; Hanine et al. 
2020). Notably, neutral atomic hydrogen H I has recently been 
proposed as an efficient catalyst for the interstellar synthesis of 
Ade and Gua (Yang et al. 2023). As the most abundant free 
radical in the Universe, it is likely that HI acts as a general 
catalyst for the formation of interstellar organics. 


7. Conclusion and Perspective 


This mini-review delves into the application of DFT in 
studying complex interstellar chemistry, particularly for 
understanding the formation of biologically relevant com- 
pounds. DFT calculations have proven valuable in elucidating 
the mechanisms behind the formation of these compounds in 
the ISM, either guiding experimental investigations or 
predicting reaction pathways. Despite these advancements, 
numerous unexplored areas remain. In our perspective, future 
research avenues should encompass the exploration of complex 
purine base formation, the elucidation of chiral amino acid 
synthesis pathways, comprehension of intricate sugar and 
biomolecule production, examination of the influence of metal 
ions and hydrogen bonding in ISM chemical reactions, 
refinement of computational models, and assessment of the 
impact of temperature and environmental variables on 
interstellar biomolecule formation. Furthermore, the integration 
of machine learning techniques offers intriguing possibilities. 
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